Assessing naive Bayes and support vector machine performance in sentiment classification on a big data platform
نویسندگان
چکیده
<p><span lang="EN-US">Nowadays, mining user reviews becomes a very useful mean for decision making in several areas. Traditionally, machine learning algorithms have been widely and effectively used to analyze user’s opinions on limited volume of data. In the case massive data, powerful hardware resources (CPU, memory, storage) are essential dealing with whole data processing phases including, collection, pre-processing, an optimal time. Several big technologies emerged efficiently process like Apache Spark, which is distributed framework that provides libraries implementing algorithms. order evaluate performance Spark's library (MLlib) large classification accuracies time two implemented spark: naive </span><span>B</span><span lang="EN-US">ayes support vector (SVM) compared achieved by standard implementation these different size datasets built from movie reviews. The results our experiment show classifiers running under spark higher than traditional ones reaches F-measure greater 84%. At same time, we found framework, relatively low.</span></p>
منابع مشابه
Sentiment Analysis Technique: A Look into Support Vector Machine and Naive Bayes
Sentiment Analysis and opinion mining aims to analyze sentiments, opinions, emotions etc. towards products, services or current topics. There are various approaches applied to mine the sentiments portrayed. Supervised machine learning is one such approach that is generally applied. The aim of this paper is to investigate the current methods used to perform sentiment analysis by reviewing and co...
متن کاملNaive-Bayes for Sentiment Classification
This report details the findings in building a naive Bayes sentiment classifier for a IMDB movie-review data set using Scala and ScalaNLP. We studied the unigram or bagof-words Bernoulli and Multinomial models and a number of different feature selection techniques, including term frequency, mutual information and Chi-squared. 1. DATA CORPUS The corpus contains of 2000 rated movie reviews, compr...
متن کاملA Novel Technique for Fingerprint Classification based on Naive Bayes Classifier and Support Vector Machine
Fingerprint classification decreases the number of possible matches in automated fingerprint identification systems by categorizing fingerprints into predefined classes. Support vector machines are widely used in pattern classification and have produced high accuracy when performing fingerprint classification. In order to effectively apply Support vector machines to multi-class fingerprint clas...
متن کاملForecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data
Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...
متن کاملQuantum support vector machine for big data classification.
Supervised machine learning is the classification of new data based on already classified training examples. In this work, we show that the support vector machine, an optimized binary classifier, can be implemented on a quantum computer, with complexity logarithmic in the size of the vectors and the number of training examples. In cases where classical sampling algorithms require polynomial tim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IAES International Journal of Artificial Intelligence
سال: 2021
ISSN: ['2089-4872', '2252-8938']
DOI: https://doi.org/10.11591/ijai.v10.i4.pp990-996